AITopics | generate caption

Country:

North America > Canada > Ontario > Toronto (0.14)
Asia > China > Hong Kong (0.04)
North America > Canada > Quebec > Montreal (0.04)
Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.04)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.52)

Neural Information Processing SystemsNov-20-2025, 18:08:51 GMT

A Neural Compositional Paradigm for Image Captioning

Bo Dai, Sanja Fidler, Dahua Lin

Neural Information Processing Systems http://nips.cc/

caption, machine learning, natural language, (21 more...)

Country:

North America > Canada > Ontario > Toronto (0.14)
Asia > China > Hong Kong (0.04)
North America > Canada > Quebec > Montreal (0.04)
Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.04)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.52)

EngadgetSep-24-2024, 16:14:33 GMT

Warner Bros. Discovery teams up with Google to generate captions using AI

Warner Bros. Discovery (WBD) has agreed a deal with Google Cloud to use the latter's Vertex AI to generate captions for programming across a variety of platforms. WBD claims that its Caption AI system can significantly reduce production time and costs while improving the accuracy of captions for US-based viewers. The tech will be used for unscripted programming at the outset, which could include news, sports and reality TV across the likes of Max, CNN and Discovery . WBD claims the system can reduce the time it takes to create captions by up to 80 percent and captioning costs by up to 50 percent. There will still be a level of human review for quality assurance, and the company claims this approach will help refine and train Caption AI's workflow to improve it over time.

caption, generate caption, warner bro, (6 more...)

Engadget

Industry:

Leisure & Entertainment (0.96)
Media > Film (0.63)

Technology: Information Technology > Artificial Intelligence > Applied AI (0.40)

#artificialintelligenceMar-15-2023, 00:05:09 GMT

CLIP for Language-Image Representation

Have you ever wondered how machines can understand the meaning behind a photograph? CLIP, the Contrastive Language-Image Pre-training model, is changing the game of image-language understanding. In this post, we will explore why CLIP is so stunning with its ability. We have seen AI's potential to solve many problems in our world. The famous AI models such as ChatGPT, LLaMA, or DALLE, etc., changing our lives (In a good way, I suppose) are direct evidence.

downstream task, language-image representation, representation, (14 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.52)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.37)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.37)

#artificialintelligenceApr-24-2021, 18:28:47 GMT

Using Machine Learning to generate captions for Images

The first and foremost step of any Machine Learning Program is to clean the data and get rid of any unwanted data. As we are dealing with text data in the captions, we will perform basic cleaning steps like converting all letters to lowercase as for a computer'Hey' and'hey' are two completely different words, removing special tokens and punctuation marks such as *, (, £, $, %, etc, and eliminating any words that contain numbers. We first create a vocabulary for all the unique in our dataset i.e, 8000 (no. of pictures) * 5 ( captions for each image) 40000 captions. We found that to be equal to 8763. But most of these words occurred just one or two times and we would not want them in our model as it will not make our model robust to outliers. Hence we set a threshold of 10 minimum occurrences of a word to be included in our vocabulary and that turns out to be equal to 1652 unique words.

caption, generate caption, machine learning, (1 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.64)

#artificialintelligenceJul-9-2020, 15:08:21 GMT

A system to produce context-aware captions for news images

Computer systems that can automatically generate image captions have been around for several years. While many of these techniques perform considerably well, the captions they produce are typically generic and somewhat uninteresting, containing simple descriptions such as "a dog is barking" or "a man is sitting on a bench." Alasdair Tran, Alexander Mathews and Lexing Xie at the Australian National University have been trying to develop new systems that can generate more sophisticated and descriptive image captions. In a paper recently pre-published on arXiv, they introduced an automatic captioning system for news images that takes the general context behind an image into account while generating new captions. The goal of their study was to enable the creation of captions that are more detailed and more closely resemble those written by humans.

caption, machine learning, natural language, (17 more...)

Industry: Media > News (0.50)

Technology:

Information Technology > Artificial Intelligence > Natural Language (0.49)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.33)

#artificialintelligenceOct-15-2019, 09:25:09 GMT

How to Generate Text from Images with Python

In the Google Search: State of the Union last May, John Mueller and Martin Splitt spent about a fourth of the address to image-related topics. They announced a big list of improvements to Google Image Search and predicted that it would be a massive untapped opportunity for SEO. SEO Clarity, an SEO tool vendor, released a very interesting report around the same time. Among other findings, they found that more than a third of web search results include images. Images are important to search visitors not only because they are visually more attractive than text, but they also convey context instantly that would require a lot more time when reading text.

caption, image url, neural network, (16 more...)

Industry: Information Technology > Services (0.35)

Technology:

Information Technology > Information Management > Search (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.71)

Neural Information Processing SystemsDec-31-2018

A Neural Compositional Paradigm for Image Captioning

Dai, Bo, Fidler, Sanja, Lin, Dahua

Mainstream captioning models often follow a sequential structure to generate cap- tions, leading to issues such as introduction of irrelevant semantics, lack of diversity in the generated captions, and inadequate generalization performance. In this paper, we present an alternative paradigm for image captioning, which factorizes the captioning procedure into two stages: (1) extracting an explicit semantic representation from the given image; and (2) constructing the caption based on a recursive compositional procedure in a bottom-up manner. Compared to conventional ones, our paradigm better preserves the semantic content through an explicit factorization of semantics and syntax. By using the compositional generation procedure, caption construction follows a recursive structure, which naturally fits the properties of human language. Moreover, the proposed compositional procedure requires less data to train, generalizes better, and yields more diverse captions.

caption, machine learning, natural language, (21 more...)

Country:

North America > Canada > Ontario > Toronto (0.14)
Asia > China > Hong Kong (0.04)
North America > Canada > Quebec > Montreal (0.04)
Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.04)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.54)

Neural Information Processing SystemsDec-31-2018

A Neural Compositional Paradigm for Image Captioning

Dai, Bo, Fidler, Sanja, Lin, Dahua

caption, machine learning, natural language, (21 more...)

Country: North America > Canada (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.54)

#artificialintelligenceApr-14-2018, 22:17:12 GMT

Building an image caption generator with Deep Learning in Tensorflow

In my last tutorial, you learned how to create a facial recognition pipeline in Tensorflow with convolutional neural networks. In this tutorial, you'll learn how a convolutional neural network (CNN) and Long Short Term Memory (LSTM) can be combined to create an image caption generator and generate captions for your own images. In 2014, researchers from Google released a paper, Show And Tell: A Neural Image Caption Generator. At the time, this architecture was state-of-the-art on the MSCOCO dataset. It utilized a CNN LSTM to take an image as input and output a caption.

caption generator, lstm, tensorflow, (13 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)